TAICAR-The Collection and Annotation of an In-Car Speech Database Created in Taiwan
نویسندگان
چکیده
This paper describes a project that aims to create a Mandarin speech database for the automobile setting (TAICAR). A group of researchers from several universities and research institutes in Taiwan have participated in the project. The goal is to generate a corpus for the development and testing of various speech-processing techniques. There are six recording sites in this project. Various words, sentences, and spontaneously queries uttered in the vehicular navigation setting have been collected in this project. A preliminary corpus of utterances from 192 speakers was created from utterances generated in different vehicles. The database contains more than 163,000 files, occupying 16.8 gigabytes of disk space.
منابع مشابه
Subspace-Based Speech Enhancement with Perceptual Filterbank and SNR-Aware Technique
In this paper, a new subspace-based speech enhancement algorithm is presented. First, we construct a perceptual filterbank from psycho-acoustic model and incorporate it with the subspace-based enhancement approach. This filterbank is created through a five-level wavelet packet decomposition. Next, the prior SNR of each critical band are taken to decide the attenuation factor of the optimal line...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملMultiband Subspace Tracking Speech Enhancement for In-Car Human Computer Speech Interaction
In this paper, a new subspace-based speech enhancement algorithm for in-car human computer speech interaction is presented. We first incorporate a perceptual filterbank which is derived from psycho-acoustic model with signal subspace approach to effectively suppress in-car noises of engine. Second, for real-time applications, a new subspace tracking algorithm is derived by modifying PASTd algor...
متن کاملSPEECHDAT-CAR. A Large Speech Database for Automotive Environments
The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic envir...
متن کاملFuzzy Neighbor Voting for Automatic Image Annotation
With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJCLCLP
دوره 10 شماره
صفحات -
تاریخ انتشار 2005